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Abstract.  To  provide  a  logic  for  reasoning  about  concurrently  executing  programs,  Abra- 
hamson  has  defined  an  extension  of  propositionzd  dynamic  logic  (PDL)  by  allowing  inter¬ 
leaving  as  an  operator  for  combining  programs,  in  addition  to  the  regular  PDL  operators 
union,  concatenation,  and  star.  We  show  that  the  satisfiability  problem  for  interleaving  PDL 
is  complete  for  deterministic  double-exponential  time,  and  that  this  prob’-'m  requires  time 
double-exponential  in  cn/logn  for  some  positive  constant  c.  Moreover,  this  lower  bound 
holds  even  when  restricted  to  formulas  where  each  program  appearing  in  the  formula  has  the 
form  oi  1 02  I  ...  la*  where  |  denotes  the  interleaving  operator  and  where  oi, . . . ,  o*  are  reg¬ 
ular  programs,  i.e.,  programs  built  firom  atomic  programs  using  only  the  regular  operators. 
Another  consequence  of  the  method  used  to  prove  this  result  is  that  the  equivalence  problem 
for  regular  expressions  with  interleaving  requires  space  and  that  this  lower  bound 

holds  even  to  decide  whether  {Ei  |  JE?2  1  ...  | .E*) U F  =  E*  where  E\,. . .  ,Ek,F  are  ordinary 
regular  expressions;  this  improves  a  previous  result  of  the  authors.  Moreove the  same  lower 
bound  holds  for  the  containment  problem  for  expressions  of  the  form  Fi  |  F2  |  ...  |  Fj^. 
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1  Introduction 


Propositional  dynamic  logic  (PDL)  was  defined  by  Fischer  and  Ladner  [4]  as  a  formal  system 
for  reasoning  about  programs.  It  is  a  propositional  version  of  a  first-order  dynamic  lofric 
introduced  earlier  by  Prat<:  [15].  In  PDL,  the  regulu  operators  (union,  concatenaticu,  and 
Kleene  star)  are  used  as  operators  for  constructing  programs.  If  a  amd  h  are  programs,  then 
a  U6  means  to  nondeterministically  run  either  a  or  5,  a;  6  (e.g.,  concatenation  of  programs) 
means  to  run  a  followed  by  6,  and  a*  means  to  run  a  any  finite  number  of  times.  The 
satisfiability  problem  for  PDL  is  known  to  be  complete  for  deterministic  exponential  time 
[4,  16].  The  efiect  on  complexity  of  using  different  formalisms  for  writing  programs  has 
been  studied,  for  example,  by  Abrahamson  [1]  for  programs  with  Boolean  variables,  and  by 
Harel,  Rosner,  and  Vardi  [7]  for  programs  specified  by  finite-state  automata  using  various 
concurrency  mechanisms  such  as  existential  branching,  universal  branching,  and  boimded 
cooperative  (e.g.,  communicating)  concurrency.  A  recent  survey  on  logics  of  programs, 
including  PDL,  is  given  by  Kozen  and  Tiuryn  [11].  The  survey  by  Harel  [6]  concentrates 
on  complexity  and  decidability  for  variants  of  PDL. 

To  permit  reasoning  about  concurrently  executing  programs,  Abrahamson  [1]  has  ex¬ 
tended  PDL  by  including  interleaving  as  an  operator  on  programs.  For  example,  if  a,  5, 
and  c  are  atomic  programs,  possible  executions  of  the  program  a*  |  (6;  c)  are  abac,  bcaaa, 
and  baaaca.  A  complete  definition  of  interleaving  PDL  (IPDL)  appears  in  Section  3.  Build¬ 
ing  on  Fischer  and  Ladner’s  [4]  nondeterministic  exponential  time  decision  procedure  for 
PDL,  Abrahamson  [1]  shows  that  the  satisfiability  problem  for  EPDL  can  be  decided  in 
nondeterministic  double-exponential  time.  Using  a  result  of  Pratt  [17]  and  Harel  and  Sher¬ 
man  [8],  the  upper  bound  can  be  improved  to  deterministic  double-exponential  time.  Our 
main  result  is  that  the  satisfiability  problem  for  IPDL  is  complete  for  deterministic  doul'V 
exponential  time  (2-EXPTIME),  and  that  a  lower  bound  on  time  is  double-exponential  in 
cn/logn  for  some  constant  c  >  0.  Moreover,  to  prove  the  lower  bound,  we  do  not  need 
the  full  power  of  IPDL  which  allows  the  interleaving  operator  to  be  arbitrarily  nested  with 
the  other  operators.  The  lower  bound  holds  even  when  restricted  to  formulas  where  each 
program  appearing  in  the  formula  has  the  form  oi  ]  02 1  ...  |  oj^  where  j  denotes  the  interleav¬ 
ing  operator  and  where  ai,...,Ofc  are  regular  programs,  i.e.,  programs  built  &om  atomic 
programs  using  only  the  regular  operators  -  union,  concatenation  and  star. 

As  noted  above,  Harel,  Rosner  and  Vardi  [7]  have  previously  studied  the  complexity  of 
PDL  under  various  models  of  concurrency.  Among  the  many  results  in  [7],  the  one  which 
is  closest  in  spirit  to  our  result  is  that  PDL  is  complete  for  2-EXPTIME  if  programs  are 
specified  by  concurrent  automata.  There  are,  however,  differences  between  the  concurrent 
automata  model  and  the  interleaving  mod<^l.  One  difference  is  that  we  use  expressions 
while  [7]  uses  automata  (and  it  is  known  that  automata  can  express  certain  languages  much 
more  succinctly  that  expressions  [3]).  Another  difference  is  that  the  conciirrent  automata 
model  corresponds  to  synchronous  concurrent  execution  with  communication,  whereas  the 
interleaving  model  corresponds  more  closely  to  asynchronous  concurrent  execution  without 
communication. 

The  proof  of  our  lower  bound  rests  on  showing  how  regular  expressions  with  interleaving 
can  succinctly  encode  Turing  machine  computations.  Using  the  same  encoding,  we  improve 
a  result  of  [12].  The  Non-Empty  Complement  (NEC)  problem  for  a  class  of  expressions 
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is  the  problem  of  deciding,  for  a  given  expression  E  over  alphabet  whether  E  does 
not  describe  all  words  in  £*.  It  is  shown  in  [12]  that  the  NEC  problem  is  exponential- 
space-complete  for  regular  expressions  with  interleaving,  and  that  this  problem  requires 
space  for  some  constant  c  >  0.  Here,  we  improve  this  result  in  two  ways:  first,  the 
lower  bound  is  improved  to  2®*/*®*";  second,  the  lower  boimd  holds  even  for  expressions 
of  the  form  {Ei  |  £'2  |  ...  ( Ek)  U  F  where  £1, . . . ,  £*,  F  are  ordinary  regular  expressions. 
The  best  known  upper  bound  is  space  so  there  is  still  a  gap.  Also  open  is  t  ie 

computationaJ  complexity  of  the  NEC  and  equivalence  problems  for  expressions  of  the  for:n 
£1 1  £2  I  ...  |£i.  We  do  show,  however,  that  the  containment  problem  for  expressions  of 
this  form  is  exponential-space-complete. 

2  Encoding  Turing  Machine  Computations 
by  Regular  Expressions  with  Interleaving 

We  assume  familiarity  with  regular  expressions  and  time  and  space  complexity;  see,  e.g., 
[9]  or  [20]  if  needed. 

The  interleaving  of  words  x  and  y,  denoted  zjy,  is  the  set  of  tdl  words  of  the  form 

xiyiX2y2  ■  • .  Xkyk 

where  x  =  «iZ2  . . .  **  and  y  =  yiy2 . . .  y*  and  where  the  words  and  y,-,  1  <  t  <  k,  can  be 
of  arbitrary  length  (including  the  empty  word).  If  X  and  Y  are  sets  of  words,  then  AT  |  T  is 
the  union  of  the  sets  z|y  over  all  z  €  AT  and  y  eY.  An  interleaving  expression  is  a  regular 
expression  which  can  contain  the  interleaving  operator,  in  addition  to  the  usual  operators 
union,  concatenation  and  star.  The  langu^e  L{E)  described  by  an  interleaving  expression 
£  is  defined  recursively  in  the  obvious  way;  in  particular,  L{Ei  \  £2)  =  I«(£i)  j  •£'(£2)-  By  a 
regular  expression  we  mean  a  regular  expression  as  usually  defined,  containing  only  union, 
concatenation  and  star.  Say  that  an  interleaving  expression  £  is  a  top-level  concurrent 
expression  if 

£  =  £ii£j|...  |£fc 

for  some  ^  >  1  and  some  regular  expressions  £1, . . . ,  Ek- 

We  define  below  a  particular  encoding  of  a  'Hiring  machine  computation  as  a  word  over 
a  finite  alphabet.  We  then  show  how  to  construct,  for  any  nondeterministic  Turing  machine 
M  with  space  boimd  2’^’*^  for  some  polynomial  p(n)  and  any  input  z,  an  interleaving 
expression  J  such  that  L{I)  contains  precisely  the  words  which  do  not  encode  accepting 
computations  of  M  on  input  z.  Moreover,  /  has  the  form  EUF  where  £  is  a  top-level 
concurrent  expression  and  £  is  a  regular  expression,  and  the  length  of  /  is  0(p(n)  log  n) 
where  n  is  the  length  of  z  (and  where  the  constant  &ctor  implicit  in  the  0-notation  depends 
on  A/). 

Before  getting  into  the  details,  it  is  useful  to  explain  the  main  idea  by  a  simple  exam¬ 
ple.  A  key  part  of  the  construction  is  a  top-level  concurrent  expression  which  can  identify 
identical  subwords  in  a  long  word,  provided  that  the  long  word  has  a  particular  restricted 
format.  We  illustrate  how  this  is  done.  Let  £  be  a  finite  dphabet,  and  let  6, cq,  ci, . . . ,  Cm-i 
be  symbols  not  in  £>.  If  u  is  a  word  with  length  divisible  by  m,  say  u  =  uqUi  •  •  •  Ux-i  where 
m  divides  2,  let  h(u)  be  the  word  obtained  by  placing  the  symbol  Cimodm  before  u,  for  all 
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».  Words  in  the  rtjstricted  format  are  those  in  i?  =  View  a  word  in  as  a 

concatenation  of  blocks,  where  a  block  is  any  subword  in  i.e.,  a  block  has  the 

form 

Co  do  Cl  di  ...  Cm-l  dtn-l  CQbc\b  ...  Cm-l  b 

for  some  do, ... ,  d^-i  €  D.  Let  P  be  the  set  of  words  w  €  R  such  that  (at  least)  two  blocks 
of  w  are  identical.  We  claim  that  the  following  top-level  concurrent  expression  A  of  length 
0(m),  when  restricted  to  words  in  R,  describes  precisely  the  words  having  two  identical 
blocks,  i.e.,  L{A)  nR  =  P. 

A=^Ao[Ai\... \A^ 

where,  for  0  <  fc  <  m  —  1, 

■^k=  U  c*  •  d  -  c*  •  6  •  c*  •  d  •  Cfc  •  i> 
deD 


and 


Am  =  (cq  ■  D  •  c\  •  D  ■  ■  •  Cm~i  •  D  ■  Co  •  b  ■  Cl  ■  b  ■  ■  ■  Cro-i  •  b)*. 


It  is  easy  to  see  that  every  word  w  6  P  belongs  to  L{A):  if  we  imagine  that  w  is 
scanned  from  left  to  right,  the  two  occurrences  of  the  repeated  block,  say  /i(dodi . . .  dm-ib'^), 
are  “parsed”  to  j4o,.Ai,...,i4m-i  where  the  d  in  the  union  for  At  matches  d^;  the  other 
blocks  are  parsed  to  ^4^.  In  the  other  direction,  suppose  that  w  6  L(A)  fl  i2,  so  u;  € 
L(wo  I  loi  1  ...  I  Wm-\)  where  Wk  €  L{Ak)  for  all  k.  A  key  observation  is  that  each  block 
of  w  must  be  either  parsed  entirely  to  Wm,  or  parsed  entirely  to  wo,...,Wm-i-  If  the 
observation  does  not  hold,  consider  the  first  block  for  which  it  fails.  If  we  start  by  parsing 
this  block  to  wo,  then  Wm  cannot  be  used  later  in  parsing  this  block  since  the  part  of  Wm 
that  has  not  been  used  yet  begins  with  cod  for  some  d  €  P.  In  the  other  case,  suppose  we 
start  by  parsing  this  block  to  Wm,  but  switch  to  tn*  when  parsing  the  subword  cjd,  where 
k  >l  and  d€  D.  Later  we  have  to  parse  the  subword  Ck-ib.  We  cannot  parse  this  subword 
to  Wm  since  the  part  of  to„,  that  has  not  been  used  yet  begins  with  Cfc.  We  cannot  parse 
this  subword  to  since  the  part  of  Wk-i  that  has  not  been  used  yet  begins  with  Ck-id 
for  some  d  £  D.  Given  this  observation,  it  is  easy  to  see  that  w  must  have  two  identical 
blocks,  namely,  the  two  blocks  that  are  parsed  entirely  to  wo,wi, . .  .,Wm-i- 

We  now  return  to  the  detadls.  Let  M  be  a  nondeterministic  Turing  machine  with  space 
bound  where  p(n)  is  a  polynomial  and  p(n)  >  n.  Fix  an  input  x  and  let  n  be  the 
length  of  X.  Let  I  =  riog2P(n)l  and  m  =  2*,  and  note  that  p(n)  <m<  2p{n).  Let  s  =  2"*, 
so  s  is  at  least  as  large  as  the  space  bound  2’^'*).  Let  Q  be  the  set  of  states  of  M  amd  let  T 
be  the  tape  symbols.  An  ID  of  Af  is  a  word  of  length  s  -t- 1  in  T^QTT*.  The  meaning  of 
the  ID  aqa0  where  a,  €  T*,  <r  €  T,  and  9  €  Q,  is  that  acr/?  is  written  on  the  tape  and  M 
is  in  state  q  with  the  head  scanning  <r.  It  will  be  useful  to  use  a  redundant  representation 
of  an  ID.  If 


a,  =  o«,oo»,i  ■ . . 

is  the  tth  ID  in  a  computation,  this  is  represented  by  the  word 


K1K2  •  •  • 

where  fcjj  =  for  1  <  j  <  s  -  1.  Let  A  =  (Q  U  T)^  be  the  alphabet  of 

symbols  used  in  the  redundant  representation. 
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As  in  [5],  we  use  ‘‘marked  binary  numbers”  to  index  the  symbols  of  an  ID.  A  marked 
binary  number  is  a  word  over  the  alphabet  {0,Q,  1,1}  in  the  lamguage  described  by  the 
expression  (0  U  1)* IQ*  UQ*;  i.e.,  the  rightmost  (lowest  order)  1  is  marked,  as  well  as  all  O’s 
to  the  right  of  this  1;  and  in  the  representation  of  0,  all  O’s  are  marked.  For  0  <  k  <  m  —  1, 
let  [k]  denote  the  length-1  marked  binary  representation  of  k.  Call  these  the  low-level 
numbers.  For  0  <  j  <  s  —  1,  let  ([;])  denote  the  length-m  mau'ked  binary  representation  of 
j.  Call  these  the  high-level  numbers.  It  is  useful  to  use  different  symbols  for  the  digits  in 
the  two  types  of  numbers,  say,  {0,1, Q,l}  for  the  low-level  and  (O',  l',Q',l'}  for  the  high- 
level.  The  marking  allows  the  successor  relation  to  be  tested  locally  as  follows.  Define 
succ(O)  =  succ(Q)  =  {0,1}  and  8ucc(l)  =  succ(l)  =  {1,Q}.  If  yi . . .  yj  =  [k],  and  z  =  zi . . .  zj 
is  a  marked  binary  number  of  length  /,  then  z  =  (k  -f  1  mod  m]  iff  z,  €  8ucc(yi)  for  1  <  i  <  1. 
Similarly,  the  successor  relation  for  the  high-level  numbers  can  be  checked  locally. 

The  idea  is  that  we  use  high-level  numbers  to  number  the  symbols  of  ID’s  obtaining  some 
word  o',  and  then  use  the  low-level  numbers  to  number  the  symbols  of  a'.  The  low-level 
numbering  is  done  as  follows.  If  tu  is  a  word  with  length  divisible  by  m,  say  w  =  ooci . . . 
where  m  divides  z,  let  g(w)  be  the  word  obtained  from  w  by  placing  the  word  2(*  mod  m]3 
before  for  all  i.  I.e., 

g(w)  =  2(0]3ao  2(l]3ai  . . .  2[m  -  l)3(r,n-i  2{0]3ffm  . . .  2[m  -  l]3ir,_i . 

(The  word  2[k]3  plays  the  role  of  the  symbol  c*  in  the  simple  example  above.) 

An  accepting  computation  of  M  on  input  x  is  represented  by 

a  =  g(a') 


where 


a'  =  [ioii#"*i[iii6s:i((2ps:2...([5-ii]6s:.-i 

[[01]  #”‘[[ipri{i2pu  •••([* (1) 

[[0)1  #"•  [[111  [[211 6r2  •  •  •  [[*  - 1]]  f>z-i  [[Oil 

where,  for  0  <  i  <  t,  the  word  6i, 16^,2 . . .  is  the  redundant  representation  of  the  tth  ID 

in  the  computation  of  M  on  input  x,  and  the  accepting  state  appears  in  bt,ibt,2  •  >  • 

A  word  a  has  the  correct  framework  if  a  =  y(a')  for  some  a'  of  the  form  (1)  where 
the  can  be  any  symbols  of  A  which  are  locally  consistent  within  the  same  ID,  i.e.,  if 
then  bij+i  =  (a2,<r3,a4)  for  some  04. 

There  is  a  regular  expression  F  of  length  0(p(n)  log  n)  which  describes  all  words  which 
(1)  do  not  have  the  correct  framework,  or  (2)  do  not  contain  the  accepting  state,  or 
(3)  60,1 . . .  6o,«-i  does  not  represent  the  initial  ID  on  input  x.  The  construction  of  F  is  fairly 
straightforward,  although  tedious,  using  standard  methods  as  in,  for  example,  [5, 13, 18, 19]. 
F  is  written  as  a  union  of  “mistakes”  which  cause  a  word  to  violate  (1),  (2),  or  (3).  The 
expression  has  length  0{ml),  i.e.,  length  0(p(n)  log  n),  since  each  type  of  mistake  involves 
making  local  checks  in  a  where  the  region  of  locality  has  length  0(ml).  For  example,  letting 
D  =  {O',  l',Q',l'}  be  the  digits  used  in  high-level  numbers  and  E  =  AuDu{0, 1,Q,1, 2, 3,  #} 
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be  the  entire  alphabet,  the  following  expression  describes  the  mistake  that  the  high-level 
numbers  are  not  incremented  correctly: 

U  2*  .  d  •  E2m(l+3)-l  .  p  _  succ((i))  •  E*  . 
deD 

As  one  more  example,  we  write  an  expression  of  length  0(n)  which  (given  that  a  has  the 
correct  framework)  describes  the  mistake  that  . . .  6o,*-i  is  not  the  redundant  representa¬ 
tion  of  the  initial  ID,  qoxB*~*',  where  B  is  the  blank  tape  symbol.  Let  ci,C2, . . .  ,c„+i  €  A 
be  such  that  the  redundant  representation  of  the  initial  ID  is  cicj  . . .  Cn+i(B,  B,  B)*~^~^. 
Let  G  =  {0, 1,Q,  1, 2,3),  D  =  DUG,  A  =  AuG,  and  #  =  {#}  U G.  Let  5'*'  abbreviate 
SS*.  The  expression  is 

D+  •  #+  •  D+  .  ((A  -  {ci})  U  Cl  •  A+  •  D+  •  ((A  -  {cj})  U  C2  •  A+  •  D+  •  ( . . . 

...  ((A  -  {c„+i})  U  c„+i  ■  A+  .  D+  •  (A  U  Dy  ■  (A  -  {{B,  B,  B)}))) . . .)  •  E*  . 

The  rest  of  the  construction  of  F  is  left  to  the  interested  reader. 

The  more  interesting  part  of  the  construction  is  an  expression  E  which  describes  the 
mistake  that  the  symbols  bij  for  i  >  1  do  not  correspond  to  a  computation  of  M.  For 
6  €  A  where  b  contains  at  most  one  occurrence  of  a  state  symbol,  let  N{h)  be  the  set  of 
triples  which  could  occur  in  the  next  ID  at  the  same  position  as  h.  We  construct  a  top-level 
concurrent  expression  E  such  that,  when  restricted  to  words  a  having  the  correct  framework, 
a  €  L{E)  iff  6<+xj  ^  N{bij)  for  some  i  and  j.  The  expression  will  identify  pairs  (fcjj,6i+ij) 
using  the  fact  that  the  same  high-level  number  {[;]]  precedes  both  and  that 

there  is  exactly  one  occurrence  of  between  them.  Recall  that  D  =  {O', 


E  =  EolEi\... \Em 


where,  for  0  <  fc  <  m  -  1, 

=  U  U(2W3  -D-2[k]3-#)’ 

d€Vi€^ 

•(2[jfc]3  •  d  •  2(jfel3  •  b  •  2[jfc)3  •  D  •  2[k]3  •  #  •  2[lk]3  •  d  •  2[jk]3  •  (A  -  N{b)) ) 
•(2[fc)3.D-2(*l3-#)* 

and 


JEm  =  (2(0]3  •  D  •  2(1]3  •  D  •  •  •  2(m  -  1]3  •  D  •  2[0]3  •  A  •  2[1]3  •  A  •  •  •  2[m  -  1]3  •  A)*. 

We  now  argue  that  E  has  the  required  property.  Assuming  that  a  has  the  correct 
framework,  let  a  block  of  a  be  any  subword  of  the  form 

2(0)3  do  2(1)3  di . . .  2(m  -  1)3  d„,_i  2(0)3  a  2(1)3  o . . .  2(m  -  1)3  a 

for  some  do,..., dm-i  €  D  and  <r  €  A  U  {#}.  In  other  notation,  a  block  has  the  form 
^((O)]  ff"*)  for  some  j  and  a.  A  #-6fock  is  a  block  where  a  =  #. 

The  easier  direction  is  the  case  where  a  is  such  that  ^  N{bij)  for  some  i  and 
j.  Imagining  that  a  is  scanned  from  left  to  right,  we  describe  how  a  is  “parsed”  to 
Eo, El,..., Em-  In  the  expression  Eu,  we  refer  to  the  first  (resp.,  last)  occurrence  of  the 
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subexpression  (2[fc)3  •  D  •  2[fc]3  •  #)*  as  the  first  (resp.,  last)  part  of  Ek,  and  we  refer  to  the 
rest  of  Ek  as  the  middle  part.  Each  non-#-block  is  parsed  to  Em  and  each  #-block  is  parsed 
to  the  first  parts  of  Eq,  . . . ,  Em-i,  until  we  reach  the  block  ff(([j]]  fcjj).  At  this  point,  i;([[i]]) 
is  parsed  to  the  middle  parts  of  Eq,  Em-u  where  the  d  in  each  such  expression  matches 
the  corresponding  digit  of  [[;)].  Then  s(6Jj)  is  parsed  to  the  middle  parts  of  Eq,  Em-i, 
where  b  matches  bij.  The  following  non-#'blocks  are  parsed  to  Em,  the  next  #-block  is 
parsed  to  the  middle  parts  of  Eq, ..., Em-i,  the  following  non-#-blocks  are  parsed  to 
Em  up  to  the  block  <7([(j]]  bi+ij).  This  block  is  parsed  to  the  middle  parts  of  Eq,  ...,  Em-i 
(so  the  middle  parts  of  £^, . . . ,  Em-i  are  now  used  up).  Each  remaining  non-#-block  is 
parsed  to  Em  and  each  remaining  #-block  is  parsed  to  the  last  parts  of  Eq,  Em-i- 
In  the  other  direction,  if  a  6  L{E),  it  can  be  seen  that  this  is  the  only  way  a  parse 
can  proceed.  Let  a  €  L(wq\wi  |  ...  ItOm)  where  Wk  €  L{Ek)-  A  first  observation  is  that 
each  subword  2[k]3a,  where  o  €  D  U  A  U  {#},  must  be  parsed  entirely  to  a  single  word, 
either  Wk  or  Wm-  A  second  key  observation  is  that  each  block  must  be  either  parsed  entirely 
to  Wm,  or  parsed  entirely  to  WQ,...,Wm-\-  Given  the  first  observation,  the  argument  for 
the  second  observation  is  exactly  as  in  the  simple  example  above,  and  we  do  not  repeat 
it.  Consider  now  the  first  block  of  a  which  is  parsed  to  the  middle  parts  of  Eq)  •  •  •  >  Em-\- 
(There  must  be  such  a  block  since  the  middle  parts  of  Eq,  . . . ,  Em-\  must  be  used.)  Say 
that  this  block  is  ^([[j]]  6^-).  This  determines  &  d  =  dk  and  a  6  =  6ik  in  the  two  unions  for 
each  Ek  {0  <  k  <  m  -  1)  where  dodi . . .  dm-i  =  [[j]]  bk  =  bij  for  all  k.  The  following 
blocks,  up  to  the  next  #-block  (call  this  #-block  0)  must  then  be  parsed  to  Em,  and  0 
must  be  parsed  to  the  middle  parts  of  EQ,...,Em-i-  Now  some  block  7  between  0  and 
the  next  #-block  after  0  must  be  parsed  to  the  middle  parts  of  Eo>  •  >  Em-i,  for  otherwise 
there  will  be  no  way  to  parse  the  next  #-biock  after  0.  Since  the  dj^’s  determine  [[7]],  we 
must  have  7  =  s((i;]]  6i+ij).  Since  bk  —  bij,  we  must  have  bi+ij  ^  N{bij). 

This  completes  the  construction  of  I  —  EUF  such  that  L{I)  E*  iff  Af  accepts  x. 

Let  EXPSPACE  denote  the  class  of  languages  which  can  be  recognized  in  space  2^**^  for 
some  polynomial  p(n).  Recall  that  the  non-empty  complement  (NEC)  problem  for  a  class 
of  expressions  is  the  problem  of  deciding,  given  an  expression  E  over  alphabet  E,  whether 
L{E)  ^  E*.  In  [12]  we  observe  that  the  NEC  problem  for  interleaving  expressions  can  be 
solved  in  space  2^^"^.  FVom  the  above  construction,  we  get  the  following. 

Theorem  2.1  The  non-empty  complement  problem  for  expressions  of  the  form  E  U  F, 
where  E  is  a  top-level  concurrent  expression  and  F  is  a  regular  expression,  is  EXPSPACE- 
complete.  There  is  a  constant  c>  0  such  that  no  Turing  machine  with  space  bound  2'*/*®*" 
can  solve  tiiis  problem. 

Remark.  By  using  the  "shuffle  resistant”  code  of  Warmuth  and  Haussler  [21]  (see  also 
Proposition  3.1  of  [12]),  this  theorem  remains  true  for  expressions  over  a  binary  alphabet 
E  =  {0,1}. 

Although  it  is  an  open  question  whether  the  NEC  or  equivalence  problems  are  EXPSPACE)- 
complete  for  top>level  concurrent  expressions,  it  follows  from  the  above  that  the  contain¬ 
ment  problem  (i.e.,  deciding  for  given  expressions  Ri  and  R2  whether  L{Ri)  C  L{R2))  is 
EXPSPACE-complete  for  top-level  concurrent  expressions. 
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Theorem  2.2  The  containment  problem  for  top-level  concurrent  expressions  is  EXPSPACE- 
complete.  There  is  a  constant  c  >  0  such  that  no  Turing  machine  vaith  space  bound  2”*^*°*  " 
can  solve  this  problem. 

Proof.  The  exponential-space  upper  bound  for  the  containment  problem  follows  easily  (like 
the  exponential-space  upper  bound  for  the  NEC  problem  in  [12])  by  converting  the  input 
expressions  R\  and  to  equivalent  (and  exponentially  larger)  nondeterministic  finite-state 
automata. 

To  prove  EXPSPACEJ-hardness,  let  F  and  E  =  Eo\Ei  \  ...  \  Em  be  the  expressions 
constructed  above,  where  F  and  the  Ei's  are  regular  expressions.  Let  b  and  c  be  symbols 
not  in  £.  Letting 

Ri  =  6"*'*'*  •  0"*+^  •  E* 

Ri  =  (6-EoUc)|(6-EiUc)|  ...  |(6.£,„Uc)|(6"*+^  •EUc"‘+^), 
it  is  easy  to  see  that  L{E  U  F)  =  E*  iff  LiRi)  C  L{R2).  I 

Remark.  A  similarity  between  the  interleaving  and  the  intersection  operators  is  that  the 
NEC  and  containment  problems  for  regular  expressions  extended  by  interleaving  have  the 
same  complexity  as  the  NEC  and  contaunment  problems  for  regular  expressions  extended 
by  intersection;  all  of  these  problems  are  EXPSPACE-complete.  For  expressions  with  in¬ 
tersection,  this  was  first  proved  by  Hunt  [10]  (see  also  Purer  [5]).  For  expressions  with 
interleaving,  this  was  proved  by  the  authors  in  [12].  In  fact,  the  proof  in  [12]  proceeds  by 
giving  a  reduction  from  the  NEC  problem  for  expressions  with  intersection  to  the  NEC 
problem  for  expressions  with  interleaving,  by  showing  how  interleaving  can  “simulate”  in¬ 
tersection  under  certain  conditions.  In  contrast,  for  expressions  in  the  restricted  forms  used 
in  Theorems  2.1  and  2.2,  replacing  interleaving  by  intersection  lowers  the  complexity  of  the 
problem:  the  NEC  and  containment  problems  for  expressions  of  the  form 


{El  nF2n...n£?fc)uF, 

where  Fi, . . . ,  F  are  regular  expressions,  can  be  solved  in  polynomial  space.  Since  these 
problems  contain  the  NEC  problem  for  regular  expressions  as  a  special  case  (where  k  =  1 
and  F  -  0),  these  problems  are  PSPACE-complete,  since  the  NEC  problem  for  regular 
expressions  is  PSPAC&complete  [13]. 

3  PDL  with  Interleaving 

We  review  Abrahamsons’s  [1]  definition  of  PDL  with  interleaving  added  as  a  program  con¬ 
structor.  We  call  this  logic  interleaving  PDL  to  avoid  confusion  with  other  definitions  of 
concurrent  PDL  in  the  literature,  e.g.,  [7, 14). 

We  begin  with  a  set  of  atomic  formulas  which  represent  propositional  variables  and 
a  set  f'o  of  atomic  programs  which  represent  indivisible  program  steps.  Syntactically,  if 
p  and  q  are  formulas  and  o  and  b  are  prc^ams,  then  p  V  9  and  -ip  are  formulas,  (a)p  is 
a  formula  meaning  “it  is  possible  to  rim  a  to  reach  a  state  in  which  p  is  true,”  a  U  6  is  a 
program  meaning  “run  either  3  or  6,”  a;  6  is  a  program  meaning  “run  a  followed  by  6,”  a*  is 
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a  program  meaning  ‘‘run  a  any  finite  number  of  times,”  p?  is  a  program  meaning  “continue 
iff  p  is  true,”  and  a|6  is  a  program  meaning  “run  a  and  6  concurrently.” 

A  model  is  a  triple  M  =  {W,  ir,  t)  where  W'  is  a  set  of  states;  for  each  atomic  formula  p, 
ir(p)  C  ly  is  the  set  of  states  in  which  p  is  true;  and  for  each  atomic  program  a,  r(a)  C  WxW 
is  the  set  of  state  transitions  of  a.  )r  is  extended  to  all  formulas  and  r  is  extended  to  all 
programs.  In  general,  t  is  a  set  of  computation  sequences,  i.e.,  a  subset  of  {W  xW^)*.  To 
extend  r,  the  sets  T(a  U6),  t(o;6),  r(a*),  and  r(a|6)  are  obtained  fi-om  r(a)  and  r(6)  by 
union,  cc  ncatenation,  star,  and  interleaving,  respectively.  r(p?)  is  the  set  of  all  (u,  u)  such 
that  u  €  v(p)-  Note  that  r(a)  can  contain  computation  sequences  such  as  (u,  v)(w,  z)  which 
do  not  make  sense  if  v  ^  w  and  if  a  is  run  alone.  We  must  include  such  sequences  since,  if  a 
is  interleaved  with  b,  the  program  b  could  make  the  transition  from  v  to  u>.  A  computation 
sequence  <r  is  legal  if,  whenever  (u,«)(u;,2)  is  a  subword  of  a,  then  v  =  w.  To  extend  tt, 
iripVq)  ~  ir(p)Utr(q)  and  ir(->p)  =  JV  —  n’(p).  Finally,  7r((o)p)  is  the  set  of  states  u  such  that 
either  there  exists  a  state  2  and  a  legal  computation  sequence  a  €  r(a)  such  that  2  €  ir(p) 
and  such  that  a  has  the  form  <y  =  (u,  )...(,  2),  or  c  G  t(o)  and  u  6  v[p). 

A  formula  is  test-free  if  it  contains  no  occurrence  of  “?”. 

A  formula  ^  is  a  top-level  concurrent  formula  if  each  program  appearing  in  (p  has  the  form 
oi  1 02  I  ...  I  Ofc  for  some  ifc  >  1  where  oi, . . . , ot  are  regular  programs,  i.e.,  these  programs 
contain  no  occurrences  of  the  interleaving  operator. 

A  formula  p  is  satisfiable  if  there  is  a  model  M  and  a  state  u  such  that  u  G  x(^). 

Pratt  [17]  and  Harel  and  Sherman  [8]  show  that  the  satisfiability  problem  for  PDL 
(without  interleaving)  can  be  decided  in  deterministic  exponential  time  even  if  programs  are 
described  by  nondeterministic  finite-state  automata  (NFA’s)  instead  of  regular  expressions. 
By  a  straightforward  cross-product  construction  (see,  e.g.,  [12,  §3]),  any  regular  expression 
with  interleaving  can  be  converted  to  an  exponentially  larger  NFA.  It  follows  that  the 
satisfiability  problem  for  interleaving  PDL  belongs  to  2-EXPTIME  (the  class  of  languages 
which  can  be  recognized  by  deterministic  Turing  machines  in  time  double-exponential  in 
p(n)  for  some  polynomial  p(n)).  This  gives  the  upper  boimd  part  of  the  following  theorem. 

Theorem  3.1  The  satisfiability  problem  for  interleaving  PDL  is  complete  for  i-EXPTIME, 
even  when  restricted  to  top-level  concurrent  formulas  which  are  test-free.  There  is  a  constant 
c  >  0  such  that  no  deterministic  Turing  machine  with  time  bound  can  solve  this 

problem. 

Proof.  To  prove  2-EXPTIME-hardness,  we  use  expressions  similar  to  the  ones  constructed 
in  the  previous  section.  But  since  a  model  of  PDL  is  a  directed  graph  rather  than  just  a 
sequence,  we  can  simulate  a  2^")  space-bounded  alternating  T\iring  machine  (ATM)  rather 
than  a  2^^**)  space-bounded  nondeterministic  Turing  machine.  This  idea  was  first  used  by 
Fischer  and  Ladner  [4]  (with  a  linear  rather  than  an  exponential  space  bound),  and  has  been 
used  in  many  other  papers  on  the  complexity  of  propositional  program  logics.  Familiarity 
with  the  ATM  model  is  assumed  [2].  Recall  that  every  language  in  2-EXPTIME  is  accepted 
by  some  ATM  with  space  bound  2^”)  for  some  polynomial  p(n).  Let  Af  be  such  an  ATM.  We 
can  assume  that  M  begins  in  an  existential  state,  existential  and  univosal  states  alternate 
at  each  step,  and  M  has  exactly  two  possible  moves  at  each  step. 

The  expression  I  of  the  previous  section  is  modified  to  describe  all  strings  which  do  not 
represent  valid  computation  paths  of  M  on  input  x.  A  valid  computation  path  is  a  sequence 
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of  ID’S  which  begins  with  the  initial  ID  on  input  x,  ends  with  an  accepting  ID,  and  such 
that  each  ID  follows  from  the  previous  one  by  the  rules  of  Af .  Instead  of  the  single  inter-ID 
marker  #,  we  use  four  markers  #c  for  c  €  {1,2}  and  r  €  {e,u}.  The  subscript  c  indicates 
whether  the  first  or  second  choice*  is  taken  in  going  from  the  ID  preceding  the  marker  to 
the  ID  following  the  marker,  and  the  superscript  r  indicates  whether  the  ID  preceding  the 
marker  is  existential  or  universal. 

Therefore,  the  sequence  of  marker  superscripts  in  (the  representation  of)  a  valid  com¬ 
putation  path  must  be  ueueue ...  (it  starts  with  u  because  a  marker  precedes  the  first  ID  in 
our  construction).  The  expression  F  for  framework  errors  contains  additional  expressions 
for  strings  not  of  this  form.  Let  F‘  denote  F  including  these  additions. 

The  expression  E  for  computation  errors  is  modified  to  take  into  account  the  subscript  c 
in  the  unique  occurrence  of  a  #-block  between  the  ID  containing  bij  and  the  ID  containing 
bi+ij.  More  precisely,  for  a  triple  6  €  A,  let  Ni{b)  (resp.,  N^ib))  be  the  set  of  triples  which 
could  occur  in  the  next  ID  at  the  same  position  as  b,  assuming  that  the  first  (resp.,  second) 
move  is  taken.  Abbreviating  #  =  {#f ,  #“,  #1,  #?}  and  #c  =  {#^  #“},  the  modified  Ek  is 

=  U  U  U  (2lk]3.D-2{A:l3-#)* 

<ie£)6€Ace{l,2} 

•(2[lk]3  •  d  •  2(lk]3  •  b  •  2(ifc]3  ■  D  •  2[A:]3  •  #c  •  2(jk]3  •  d  ■  2[fc]3  •  ^A  -  Ar,(6)) ) 
•(2[k]3  •  D  •  2[k]Z  ■  #)•  . 

Let  E'  denote  the  expression  E  after  these  modifications,  i.e., 

Now  an  interleaving  PDL  formula  (p  is  constructed  so  that  ip  is  satisfiable  iff  M  accepts 
X.  The  length  of  <p  is  0(p(n)  log  n).  The  set  of  atomic  programs  is  the  alphabet  S  used  in 
E'  U  F',  and  there  is  one  atomic  formula  P.  A  model  of  PDL  is  essentially  a  directed  multi- 
graph  (i.e.,  there  can  be  multiple  edges  between  two  nodes),  where  each  edge  (transition) 
is  labeled  with  an  element  of  £,  and  each  node  (state)  is  labeled  either  P  or  ->P  depending 
on  whether  P  is  true  or  false  at  that  state.  The  graph  has  a  distinguished  state  u,  the  state 
where  ip  is  true.  Since  all  that  matters  about  a  model  is  its  reachability  structure  with 
respect  to  labeled  paths,  it  is  useful  to  imagine  that  the  graph  has  been  “unwcund”  into  a 
directed  tree  rooted  at  u  with  all  paths  directed  away  from  u.  The  idea  is  that  the  portion 
of  the  tree  where  P  is  true  should  contain  an  accepting  computation  tree  of  M  on  input  x. 
That  is,  (1)  if  p  is  a  directed  path  which  starts  at  u  and  terminates  at  the  point  where  P 
first  becomes  false,  then  p  should  represent  a  valid  computation  path  of  M  (more  precisely, 
the  sequence  of  atomic  programs  labeling  the  path  p  should  represent  a  valid  computation 
path),  and  (2)  for  each  universal  ID  in  the  computation  tree,  both  successors  of  this  ID 
should  be  in  the  computation  tree. 

As  usual,  let  [a]p  abbreviate  ->{a)-<p-,  the  intuitive  meaning  of  [a]p  is  "all  ways  of  numing 
a  reach  a  state  where  p  is  true.” 

^  is  a  conjunction  of  several  components  described  next.  IVaLnslations  into  formal  PDL 
are  also  given. 
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1.  P  is  true  at  u. 
Translation:  P. 


2.  For  every  state  w  reachable  from  u,  it  is  possible  to  reach  from  w  a  state  where  P  is 
false. 

Translation:  [E*|((E*)^P). 

(Note:  Using  E*  to  define  “reachable”,  a  state  is  always  reachable  from  itself.) 

3.  There  are  no  invalid  computation  paths  starting  at  u.  That  is,  it  is  impossible  to  run 
E'  U  F'  from  u  to  reach  a  stat  j  where  P  is  false. 

Translation:  ->{E')->P  A  ->{F*)->P. 

(Note:  We  break  this  formula  mto  a  conjunction,  one  part  for  E'  and  one  part  for  F', 
to  obtain  a  top-level  concurrent  formula.) 

4.  Both  possible  moves  must  be  taken  after  every  universal  ID.  That  is,  for  every  state 
w  reachable  from  u  and  every  c  €  {1,2},  if  it  is  possible  to  run  s((#c)"‘)  from  w  then 
it  is  possible  to  run  p((#3_c)'”)  from  w. 

Translation:  Ac6{i.2}  (Sl(((5((#“)”‘))  true)  ((5((#lf_c)”‘))  true)). 
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Remarks. 

(1)  By  the  remark  following  Theorem  2.1,  two  atomic  programs  suflBce  to  prove  Theorem  3.1. 

(2)  The  atomic  formula  P  in  the  construction  can  be  replaced  by  the  formula  ((E)  true), 
which  is  true  at  a  state  w  iff  there  is  some  transition  out  of  state  w.  That  is,  computation 
paths  terminate  at  states  which  have  no  outgoing  transitions.  So  Theorem  3.1  remains  true 
for  formulas  containing  no  atomic  formulas. 

(3)  It  follows  from  [8]  that  an  upper  bound  on  the  complexity  of  IPDL  is  deterministic  time 
2^‘'"  for  some  constant  d  >  0.  There  is  a  gap  between  this  upper  boimd  and  the  lower 
bound  of  Theorem  3.1. 

(4)  2-EXPTIME-completeness  holds  also  for  deterministic  PDL  (DPDL)  with  interleaving. 
In  DPDL,  models  {W,  ir,  t)  are  restricted  to  those  such  that,  for  every  atomic  program  o,  if 
(u,  v)  €  r(a)  and  (u,  w)  €  r(a)  then  v  =  w.  (This  does  not  have  to  hold  for  non-atomic  a, 
however.)  The  2-EXPTIME  upper  bound  follows  again  from  (8].  We  need  only  deterministic 
atomic  programs  to  prove  the  lower  bound. 
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